# WER optimization

Whisper Small El
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned from the openai/whisper-small model for Greek speech recognition tasks, trained on 3,620 Greek samples from the Mozilla Common Voice 17.0 dataset.
Speech Recognition Transformers Other
W
mozilla-ai
94
1
XLSR WithLM Malayalam
Apache-2.0
This model is a fine-tuned version of facebook/wav2vec2-xls-r-300m on the IMaSC, Indic TTS Malayalam, and OpenSLR Malayalam training datasets, supporting automatic speech recognition for Malayalam.
Speech Recognition Transformers
X
kavyamanohar
19
4
Whisper Small Sk Cv11
Apache-2.0
Slovak speech recognition model fine-tuned on OpenAI Whisper-small, trained on the Common Voice 11.0 Slovak dataset
Speech Recognition Transformers Other
W
mikr
79
2
English Filipino Wav2vec2 L Xls R Test 04
Apache-2.0
This model is a fine-tuned version of jonatasgrosman/wav2vec2-large-xlsr-53-english on the filipino_voice dataset, designed for English-Filipino speech recognition tasks.
Speech Recognition Transformers
E
Khalsuu
21
0
English Filipino Wav2vec2 L Xls R Test
Apache-2.0
English-Filipino speech recognition model fine-tuned based on jonatasgrosman/wav2vec2-large-xlsr-53-english
Speech Recognition Transformers
E
Khalsuu
18
0
Wav2vec2 Base Toy Train Data Random Low Pass
Apache-2.0
This model is a speech recognition model fine-tuned on an unknown dataset based on facebook/wav2vec2-base, primarily used for Automatic Speech Recognition (ASR) tasks.
Speech Recognition Transformers
W
scasutt
29
0
Wav2vec2 Large Xlsr 53 Toy Train Data Masked Audio 10ms
Apache-2.0
Speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, optimized on 10ms audio masked training data
Speech Recognition Transformers
W
scasutt
22
0
Wav2vec2 Xls R 300m Gl CV8
Apache-2.0
This model is a fine-tuned speech recognition model based on Facebook's wav2vec2-xls-r-300m on the Common Voice Galician (gl) dataset, achieving a word error rate (WER) of 20.8% on the test set.
Speech Recognition Transformers Other
W
emre
18
0
Xls R Kyrgiz Cv8
Apache-2.0
This model is a fine-tuned automatic speech recognition model based on facebook/wav2vec2-xls-r-300m on the Common Voice 8.0 Kyrgyz dataset
Speech Recognition Transformers Other
X
lucio
16
0
Wav2vec2 Large Xlsr 53 Hsb
Apache-2.0
Upper Sorbian speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, supporting 16kHz audio input
Speech Recognition Other
W
anuragshas
23
0
Wav2vec2 Xls R 300m Wolof Lm
MIT
This is a Wolof automatic speech recognition model fine-tuned based on facebook/wav2vec2-xls-r-300m, aimed at addressing the scarcity of Wolof language resources.
Speech Recognition Transformers Other
W
abdouaziiz
41
4
Xls R Ab Test
This model is an automatic speech recognition model fine-tuned on the Common Voice 7.0 AB dataset, based on the XLS-R dummy architecture
Speech Recognition Transformers Other
X
cahya
20
0
Wav2vec2 Random
An automatic speech recognition model fine-tuned on the TIMIT_ASR dataset based on the wav2vec2-base-random model
Speech Recognition Transformers
W
patrickvonplaten
16
0
Wav2vec2 Xlsr Breton
Apache-2.0
This model is a fine-tuned automatic speech recognition model for Breton based on facebook/wav2vec2-xls-r-1b.
Speech Recognition Transformers Other
W
sammy786
13
0
Wav2vec2 Xls R 300m Gn Cv8
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on the Common Voice 8 dataset based on the facebook/wav2vec2-xls-r-300m model, supporting Guarani (gn).
Speech Recognition Transformers Other
W
lgris
16
0
Sew Tiny Portuguese Cv8
Apache-2.0
This is a Portuguese automatic speech recognition model based on the SEW-tiny architecture, fine-tuned on the Common Voice 8 dataset, suitable for Portuguese speech recognition tasks.
Speech Recognition Transformers Other
S
lgris
16
0
Wav2vec2 Georgian Daytona
Apache-2.0
A Georgian speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, trained on the Common Voice dataset
Speech Recognition Other
W
Temur
19
2
Wav2vec2 Large Xlsr Turkish Demo Colab
Apache-2.0
This model is a fine-tuned Turkish speech recognition model based on facebook/wav2vec2-large-xlsr-53 on the Common Voice dataset
Speech Recognition Transformers
W
patrickvonplaten
14
2
Wav2vec2 Large Xls R 300m Br D10
Apache-2.0
This is a speech recognition model fine-tuned on Breton language dataset based on facebook/wav2vec2-xls-r-300m, achieving a 52.3% Word Error Rate (WER) on the Common Voice 8 test set.
Speech Recognition Transformers Other
W
DrishtiSharma
21
0
Wav2vec2 Large Xls R 300m Hsb V1
Apache-2.0
This is an automatic speech recognition model fine-tuned on the Upper Sorbian (HSB) dataset based on facebook/wav2vec2-xls-r-300m, achieving a word error rate (WER) of 0.4393 on the Common Voice 8 test set.
Speech Recognition Transformers Other
W
DrishtiSharma
20
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase